AITopics | safety property

Collaborating Authors

safety property

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

448d5eda79895153938a8431919f4c9f-Supplemental.pdf

Neural Information Processing SystemsFeb-8-2026, 06:15:26 GMT

benchmark, environment interaction, obstacle, (15 more...)

Neural Information Processing Systems

Industry: Transportation (0.48)

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.48)

Add feedback

Property-Guided Cyber-Physical Reduction and Surrogation for Safety Analysis in Robotic Vehicles

Sayom, Nazmus Shakib, Garcia, Luis

arXiv.org Artificial IntelligenceDec-3-2025

We propose a methodology for falsifying safety properties in robotic vehicle systems through property-guided reduction and surrogate execution. By isolating only the control logic and physical dynamics relevant to a given specification, we construct lightweight surrogate models that preserve property-relevant behaviors while eliminating unrelated system complexity. This enables scalable falsification via trace analysis and temporal logic oracles. We demonstrate the approach on a drone control system containing a known safety flaw. The surrogate replicates failure conditions at a fraction of the simulation cost, and a property-guided fuzzer efficiently discovers semantic violations. Our results suggest that controller reduction, when coupled with logic-aware test generation, provides a practical and scalable path toward semantic verification of cyber-physical systems.

artificial intelligence, safety property, verification, (17 more...)

arXiv.org Artificial Intelligence

2512.0227

Country: North America > United States (1.00)

Genre: Research Report > New Finding (0.54)

Industry:

Transportation > Air (0.48)
Health & Medicine (0.46)

Technology: Information Technology > Artificial Intelligence > Robots > Autonomous Vehicles > Drones (0.46)

Add feedback

Efficient Formal Safety Analysis of Neural Networks

Shiqi Wang, Kexin Pei, Justin Whitehouse, Junfeng Yang, Suman Jana

Neural Information Processing SystemsNov-20-2025, 15:27:09 GMT

Such mistakes can have disastrous and even potentially fatal consequences.

artificial intelligence, machine learning, neural network, (19 more...)

Neural Information Processing Systems

Country:

North America > United States > Massachusetts (0.04)
North America > United States > California (0.04)
North America > Canada > Quebec > Montreal (0.04)

Industry:

Transportation > Ground > Road (0.69)
Automobiles & Trucks (0.69)
Information Technology > Robotics & Automation (0.47)
(2 more...)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Robots > Autonomous Vehicles (0.96)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.47)

Add feedback

Dataset Safety in Autonomous Driving: Requirements, Risks, and Assurance

Abbaspour, Alireza, Patil, Tejaskumar Balgonda, Kiran, B Ravi, Mohr, Russel, Yogamani, Senthil

arXiv.org Artificial IntelligenceNov-12-2025

Dataset integrity is fundamental to the safety and reliability of AI systems, especially in autonomous driving. This paper presents a structured framework for developing safe datasets aligned with ISO/PAS 8800 guidelines. Using AI-based perception systems as the primary use case, it introduces the AI Data Flywheel and the dataset lifecycle, covering data collection, annotation, curation, and maintenance. The framework incorporates rigorous safety analyses to identify hazards and mitigate risks caused by dataset insufficiencies. It also defines processes for establishing dataset safety requirements and proposes verification and validation strategies to ensure compliance with safety standards. In addition to outlining best practices, the paper reviews recent research and emerging trends in dataset safety and autonomous vehicle development, providing insights into current challenges and future directions. By integrating these perspectives, the paper aims to advance robust, safety-assured AI systems for autonomous driving applications.

artificial intelligence, dataset, survey article, (16 more...)

arXiv.org Artificial Intelligence

2511.08439

Genre:

Overview (1.00)
Research Report > New Finding (0.46)

Industry:

Transportation > Ground > Road (1.00)
Information Technology > Robotics & Automation (1.00)
Automobiles & Trucks (1.00)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Robots > Autonomous Vehicles (1.00)

Add feedback

Floating-Point Neural Network Verification at the Software Level

Manino, Edoardo, Farias, Bruno, Menezes, Rafael Sá, Shmarov, Fedor, Cordeiro, Lucas C.

arXiv.org Artificial IntelligenceOct-28-2025

The behaviour of neural network components must be proven correct before deployment in safety-critical systems. Unfortunately, existing neural network verification techniques cannot certify the absence of faults at the software level. In this paper, we show how to specify and verify that neural networks are safe, by explicitly reasoning about their floating-point implementation. In doing so, we construct NeuroCodeBench 2.0, a benchmark comprising 912 neural network verification examples that cover activation functions, common layers, and full neural networks of up to 170K parameters. Our verification suite is written in plain C and is compatible with the format of the International Competition on Software Verification (SV-COMP). Thanks to it, we can conduct the first rigorous evaluation of eight state-of-the-art software verifiers on neural network code. The results show that existing automated verification tools can correctly solve an average of 11% of our benchmark, while producing around 3% incorrect verdicts. At the same time, a historical analysis reveals that the release of our benchmark has already had a significantly positive impact on the latter.

artificial intelligence, machine learning, verifier, (18 more...)

arXiv.org Artificial Intelligence

2510.23389

Country:

North America > United States (1.00)
Europe (1.00)

Genre: Research Report > New Finding (0.87)

Industry:

Government (0.67)
Information Technology > Security & Privacy (0.45)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

A Verification Methodology for Safety Assurance of Robotic Autonomous Systems

Adam, Mustafa, Anisi, David A., Ribeiro, Pedro

arXiv.org Artificial IntelligenceOct-16-2025

Autonomous robots deployed in shared human environments, such as agricultural settings, require rigorous safety assurance to meet both functional reliability and regulatory compliance. These systems must operate in dynamic, unstructured environments, interact safely with humans, and respond effectively to a wide range of potential hazards. This paper presents a verification workflow for the safety assurance of an autonomous agricultural robot, covering the entire development life-cycle, from concept study and design to runtime verification. The outlined methodology begins with a systematic hazard analysis and risk assessment to identify potential risks and derive corresponding safety requirements. A formal model of the safety controller is then developed to capture its behaviour and verify that the controller satisfies the specified safety properties with respect to these requirements. The proposed approach is demonstrated on a field robot operating in an agricultural setting. The results show that the methodology can be effectively used to verify safety-critical properties and facilitate the early identification of design issues, contributing to the development of safer robots and autonomous systems.

artificial intelligence, requirement, verification, (16 more...)

arXiv.org Artificial Intelligence

doi: 10.1007/978-3-032-01486-3_23

2506.19622

Genre:

Research Report (0.70)
Workflow (0.49)

Industry:

Government (0.35)
Law (0.34)

Technology:

Information Technology > Artificial Intelligence > Robots (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (1.00)

Add feedback

Bridging Neural ODE and ResNet: A Formal Error Bound for Safety Verification

Sayed, Abdelrahman Sayed, Meyer, Pierre-Jean, Ghazel, Mohamed

arXiv.org Artificial IntelligenceOct-14-2025

A neural ordinary differential equation (neural ODE) is a machine learning model that is commonly described as a continuous-depth generalization of a residual network (ResNet) with a single residual block, or conversely, the ResNet can be seen as the Euler discretization of the neural ODE. These two models are therefore strongly related in a way that the behaviors of either model are considered to be an approximation of the behaviors of the other. In this work, we establish a more formal relationship between these two models by bounding the approximation error between two such related models. The obtained error bound then allows us to use one of the models as a verification proxy for the other, without running the verification tools twice: if the reachable output set expanded by the error bound satisfies a safety property on one of the models, this safety property is then guaranteed to be also satisfied on the other model. This feature is fully reversible, and the initial safety verification can be run indifferently on either of the two models. This novel approach is illustrated on a numerical example of a fixed-point attractor system modeled as a neural ODE.

artificial intelligence, machine learning, neural ode, (16 more...)

arXiv.org Artificial Intelligence

2506.03227

Country: Europe (0.46)

Genre: Research Report (0.70)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.68)

Add feedback

A Safely Imitating a Neural Policy

Neural Information Processing SystemsOct-9-2025, 14:09:55 GMT

Here we provide proofs of the theoretical results from Section 3.2 and extend the discussion of a few

artificial intelligence, benchmark, machine learning, (17 more...)

Neural Information Processing Systems

Industry: Transportation (0.48)

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.48)

Add feedback

Verifying Memoryless Sequential Decision-making of Large Language Models

Gross, Dennis, Spieker, Helge, Gotlieb, Arnaud

arXiv.org Artificial IntelligenceOct-9-2025

We introduce a tool for rigorous and automated verification of large language model (LLM)- based policies in memoryless sequential decision-making tasks. Given a Markov decision process (MDP) representing the sequential decision-making task, an LLM policy, and a safety requirement expressed as a PCTL formula, our approach incrementally constructs only the reachable portion of the MDP guided by the LLM's chosen actions. Each state is encoded as a natural language prompt, the LLM's response is parsed into an action, and reachable successor states by the policy are expanded. The resulting formal model is checked with Storm to determine whether the policy satisfies the specified safety property. In experiments on standard grid world benchmarks, we show that open source LLMs accessed via Ollama can be verified when deterministically seeded, but generally underperform deep reinforcement learning baselines. Our tool natively integrates with Ollama and supports PRISM-specified tasks, enabling continuous benchmarking in user-specified sequential decision-making tasks and laying a practical foundation for formally verifying increasingly capable LLMs.

large language model, machine learning, reinforcement learning, (16 more...)

arXiv.org Artificial Intelligence

2510.06756

Genre: Research Report > New Finding (0.46)

Industry: Transportation (0.70)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.70)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Undirected Networks > Markov Models (0.48)

Add feedback

Speculative Safety-Aware Decoding

Wang, Xuekang, Zhu, Shengyu, Cheng, Xueqi

arXiv.org Artificial IntelligenceSep-30-2025

Despite extensive efforts to align Large Language Models (LLMs) with human values and safety rules, jailbreak attacks that exploit certain vulnerabilities continuously emerge, highlighting the need to strengthen existing LLMs with additional safety properties to defend against these attacks. However, tuning large models has become increasingly resource intensive and may have difficulty ensuring consistent performance. We introduce Speculative Safety-Aware Decoding (SSD), a lightweight decoding-time approach that equips LLMs with the desired safety property while accelerating inference. We assume that there exists a small language model that possesses this desired property. SSD integrates speculative sampling during decoding and leverages the match ratio between the small and composite models to quantify jailbreak risks. This enables SSD to dynamically switch between decoding schemes to prioritize utility or safety, to handle the challenge of different model capacities. The output token is then sampled from a new distribution that combines the distributions of the original and the small models. Experimental results show that SSD successfully equips the large model with the desired safety property, and also allows the model to remain helpful to benign queries. Furthermore, SSD accelerates the inference time, thanks to the speculative sampling design.

large language model, machine learning, natural language, (17 more...)

arXiv.org Artificial Intelligence

2508.17739

Genre: Research Report > New Finding (0.66)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.97)

Add feedback